Dynamic language models for interactive speech applications
نویسندگان
چکیده
This work proposes the use of hierarchical LMs as an effective method both for e ciently dealing with contextdependent LMs in a dialogue system and for increasing the robustness of LM estimation and adaptation. Starting from basic LMs that express elementary semantic units, concepts, or data-types, sentence level LMs are recursively built. The resulting LMs may be a combination of grammars, word classes, and statistical LMs. Moreover, these LMs can be e ciently compiled into probabilistic recursive transition networks. A speech decoding algorithm directly exploits the recursive representation and produces the most probable parse tree matching the speech signal. The proposed approach has been implemented for a dataentry task which covers structured data, e.g. numbers, dates, and proper names, as well as free texts. In this task, the active LM must continuously change according to the current status, the active form, and the data entered so far. Finally, while the hierarchical approach results very convenient to cope with this task, it also looks very general and can give advantages in other applications, e.g. dictation.
منابع مشابه
A Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملCombining syntactical and statistical language constraints in context-dependent language models for interactive speech applications
In interactive speech applications the expected vocabulary and the expected user utterances change from one dialogue step to the next one. The use of several context dependent language models results in a better system performance than the use of a single model. In this paper we present a new approach combining syntactical and statistical language constraints to a single language model. Recogni...
متن کاملUnified language modeling using finite-state transducers with first applications
In this paper, we investigate a weighted finite-state transducer approach to language modelling for speech recognition applications. We explore a unified framework to conversational speech recognition which combines the benefits of grammars, n-gram and class-based language models, with the flexibility of using dynamic data, and the potential for integrating semantics. Based on a virtual persona...
متن کاملInvestigation of SLIM Dynamic Models Based on Vector Control for Railway Applications
Although, Single-Sided Linear Induction Motor (SLIM) utilization has increased in railway applications due to their numerous advantages in comparison to Rotational Induction Motors (RIM), there are some sophistication in their mathematical models and electrical drive. This paper focuses on the problems of SLIM modeling, with assuming end-effect on the basis of Field Oriented Control (FOC) as a ...
متن کاملModel-Based, Multimodal Interaction in Document Browsing
In this paper we introduce a dynamic system approach to the design of multimodal interactive systems. We use an example where we support human behavior in browsing a document, by adapting the dynamics of navigation and the visual feedback (using a focus-in-context (F+C) method) to support the current inferred task. We also demonstrate non-speech audio feedback, based on a language model. We arg...
متن کامل